Using Naı̈ve Text Queries for Robust Audio Information Retrieval

نویسندگان

Samuel Kim

Panayiotis Georgiou

Shrikanth Narayanan

Shiva Sundaram

چکیده

The goal of this work is to build an audio information retrieval system which provides users with flexibility in formulating their queries: from audio examples to naı̈ve text. Specifically, the focus of this paper is on using naı̈ve text to create input queries describing the desired information of the users. Using naı̈ve text queries, however, raises interoperability issues between annotation and retrieval processes due to the wide variety of available audio descriptions. In this paper, we propose an intermediate audio description layer (iADL) to solve the interoperability issues between the annotation and retrieval processes. The iADL comprises two axes corresponding to semantic and onomatopoeic descriptions based on human-to-human communication experiments on how humans express sounds verbally. Various text modeling schemes, such as latent semantic analysis (LSA) and latent topic model, are utilized to transform the naı̈ve text onto the proposd iADL.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using naïve text queries for robust audio information retrieval

متن کامل

A system for spoken query information retrieval on mobile devices

We present a system which allows the user to search for information on mobile devices using spoken natural language queries. This is the first work that we are aware of which evaluates spoken query based information retrieval on a commonly available and well researched text database, the Chinese news corpus used in National Institute of Standards and Technology (NIST)’s TREC-5 and TREC-6 confer...

متن کامل

A new term-weighting scheme for naïve Bayes text categorization

Purpose – Automatic text categorization has applications in several domains, for example e-mail spam detection, sexual content filtering, directory maintenance, and focused crawling, among others. Most information retrieval systems contain several components which use text categorization methods. One of the first text categorization methods was designed using a naı̈ve Bayes representation of the...

متن کامل

Mandarin-English Information (MEI)

Mandarin-English Information (MEI) is one of the four projects selected for the Johns Hopkins University Summer Workshop 2000. We plan to develop technologies for using written queries to search spoken documents (cross-media) between English and Mandarin Chinese (cross-language). Our research focus is on the integration of speech recognition and machine translation technologies in the context o...

متن کامل

Modeling music and words using a multi-class naı̈ve Bayes approach

We propose a query-by-text system for modeling a heterogeneous data set of music and words. We quantitatively show that our system can both annotate a novel song with semantically meaningful words and retrieve relevant unlabeled songs from a database given a text-based query. We explain two feature extraction methods useful for summarizing the audio content of a song. We describe a supervised m...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Using Naı̈ve Text Queries for Robust Audio Information Retrieval

نویسندگان

چکیده

منابع مشابه

Using naïve text queries for robust audio information retrieval

A system for spoken query information retrieval on mobile devices

A new term-weighting scheme for naïve Bayes text categorization

Mandarin-English Information (MEI)

Modeling music and words using a multi-class naı̈ve Bayes approach

عنوان ژورنال:

اشتراک گذاری